Extreme value statistics and return intervals in long-range correlated uniform deviates 
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We study extremal statistics and return intervals in stationary long-range correlated sequences for 
which the underlying probability density function is bounded and uniform. The extremal statistics 
we consider (e.g. maximum relative to minimum) are such that the reference point from which the 
maximum is measured is itself a random quantity. We analytically calculate the limiting distributions 
for independent and identically distributed random variables, and use these as a reference point for 
correlated cases. The distributions are different from that of the maximum itself (i.e. a Weibull 
distribution), reflecting the fact that the distribution of the reference point either dominates over or 
convolves with the distribution of the maximum. The functional form of the limiting distributions 
is unaffected by correlations, although the convergence is slower. We show that our findings can 
be directly generalized to a wide class of stochastic processes. We also analyze return interval 
distributions, and compare them to recent conjectures of their functional form. 

PACS numbers: 02.50.-r,05.40.-a,05.45.Tp 



I. INTRODUCTION 

Interest in the extreme behavior of complex systems 
has been growing, with examples including the estima- 
tion of DNA replication times [l| , extreme paths on ran- 
dom trees 0, Q and their applications to computer sci- 
ence [3], extreme eigenvalues in random matrices and 
low- lying states in disordered systems Q , and extreme 
values in multifractal processes Q- In particular, much 
attention has been paid to extreme value statistics in 
time series. This is especially relevant in the context 
of disasters and hazard assessment Q, flood prediction 
being a notable example fsj. In regard to this, results ex- 
ist in the mathematical literature [lo| which extend the 
range of extreme limit distributions based on identically 
and independently distributed (iid) random variables to 
a wide class of dependent time series, see [ll| for a re- 
view. Yet, the exact extremal properties of many time se- 
ries exhibiting long-range correlations are far from being 
fully understood. Examples of such time series in nature 
include crackling noise [l!], water levels in rivers [l3|, 
temperature fluctuations in oceans or climatologi- 
cal temperature records [H, [lB| ■ Much insight into the 
extremes of these time series can be gained by studying 
so-called 1//" signals [13, [H, [3 (i'^ reference to the 
power-law decay of the power spectrum), which capture 
many of the essential features of real-world long-range 
correlated time series. 

Aside from the distribution of extremes, another very 
useful and practical indicator for hazard assessment 
based on time series is the distribution of return inter- 
vals between successive threshold-crossing events. For 
uncorrelated time series, the Poisson process gives rise 
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to the well-known exponential return interval distribu- 
tion For long-range correlated series encountered 
in nature, however, the distribution is no longer expo- 
nential and a number of distributions have been put 
forward, depending on the data set, such as gamma 
distributions and pow er-laws with stretched exponential 
tails HH, m m in [p. Artiflciallvgenerated 1//" 
signals have been studied in detail [26r[27t and some 
progress has been made towards a theoretical under- 
standin g of return intervals in long-range correlated se- 
ries 2^ 2^ [13] , but the overall picture is far from com- 
plete. 

Here, we focus on the extremal properties and return 
intervals of stationary long-range correlated series for 
which the underlying probability density function (PDF) 
is uniform over a given finite interval, i.e. the Xi that 
make up the series {Xi}i=i...„ are uniformly distributed. 
These processes are of general interest and have been ne- 
glected in the past. For iid random variables, a single 
parameter family of distributions describes the possible 
limiting distribution for the maximum of a sample [sij . 
This family is traditionally broken up into three qual- 
itatively different distributions: Frechet, Gumbel and 
Weibull. Provided certain conditions are met, a given 
underlying PDF for the iid random variables will fall 
within the domain of attraction of one of these three ex- 
treme value distributions. Previous studies have concen- 
trated on time series with underlying PDF that belong to 
the Gumbel or Frechet domains of attraction. Random 
variables distributed uniformly on [0, 1], however, belong 
to the Weibull domain of attraction [4^. In fact, the 
same is true for a series of stationary long-range corre- 
lated uniform deviates, provided certain mixing criteria 
are met [Tol. [Tl|. However, we show in this paper that a 
variety of extremal quantities converge to very different 
limiting distributions. When the maximum is measured 
relative to a quantity that is itself a random variable, the 
extremal distribution need no longer be Gumbel, Frechet 
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or WeibuU. In this paper we consider the following three 
extremal quantities: (a) maximum relative to the average 
(over the Xi), introduced in the context of interfaces [32], 

(b) maximum relative to the initial value (lol . Issj , which 
is a natural measure in the context of time series (e.g. 
the maximum increase of a stock from its starting price), 

(c) maximum relative to minimum, which is a measure 
of the full range of values encountered (e.g. as measured 
in auroral indices j34|). 

The structure of the paper is as follows: In Section ITTl 
we present analytic results for extremal distributions for 
iid random variables and we briefly discuss how our re- 
sults may be generalised to other distributions belonging 
to the WeibuU class, e.g. the beta distribution. In Sec- 
tion |TTT1 we then compare the iid results to the respective 
distributions in stationary long-range correlated series. 
In Section IIVI we present the corresponding return inter- 
val distributions and examine their asymptotic behavior. 
We conclude the paper in Section |Vl In an Appendix, 
we discuss some of the features of the Schreiber-Schmitz 
algorithm [s^, [sH] for generating long-range correlated 
series with a specified underlying PDF. 



scale parameter), where y := a„z + 6„. That is, 

J zdF"(a„z + 6„) -0 (3) 
J z^dF^{anZ + b„)^l. (4) 



A. Maximum relative to average 

Labelling the maximum as x™"^, the maximum rela- 
tive to the average is defined as z :— xjf"^ — ^ J2^=i ^i- 
By the central limit theorem, the average of n iid uniform 
deviates approaches a Gaussian distribution centered at 
1/2 with a width that shrinks as 0{n~^/'^). The choice of 
a„ = 1/n in Eq. ([2]) shows that the maximum approaches 
1 as an exponential distribution with a width that shrinks 
as 0{n''^). The scaling of a„ is, to leading order, generic 
up to prefactors [Toj. Therefore, as n ^ oo, the spread 
in the average dominates, and the PDF of the maximum 
relative to the average is 

p{y) ^ -^e-y'/\ (5) 



II. EXTREMAL STATISTICS FOR IID 

UNIFORM DEVIATES with zero mean and unit standard deviation. 



Consider a set of n iid random variables {^i, . . . , Xn\ 
drawn from the (cumulative) distribution function F{z). 
The distribution of the maximum A/„ is 

Pr(M„ < z) ^ Pr(Xi < z, . . . , X„ < z) 

= F"(z) (1) 

A fundamental result of extreme value theory is that 
if _F"(a„z -|- 6„) converges to a non-degenerate limit- 
ing distribution as n — > cxd, where a„ and 6„ are scale 
and location parameters, respectively, that effect a lin- 
ear rescaling, the distribution can be one of only three 
types. The eventual limiting distribution is determined 
by the asymptotic behavior of the underlying F{z) [3l| . 
For the specific example of n iid uniform deviates with 
probability density p{z) = 1 for < z < 1, the choice 
a„ = bn = 1 gives 

lim F'^{z/n + 1) ^ lim {z/n + 1)" 

n— i-OO 7l—>-OG 

(2) 

which is an example of a WeibuU distribution. While 
the choice of an and 6„ is not unique, the particular 
choice does not influence the functional form of the limit- 
ing distribution according to the Khinchin theorem . 
It does, however, determine convergence rates [37| . In 
this paper we choose a„ and &„ so that all limiting PDFs 
p{y) — ^^^^ are standardized with zero mean (via the 
location parameter) and unit standard deviation (via the 



B. Maximum relative to initial value 

Given an initial value cci, the maximum relative to the 
initial value is defined as z := — xi. The initial 

value is a random variable with a distribution having 
a width of 0(1). Therefore, by the same argument as 
above, only the spread in the initial value is observed 
for n — !■ cx). Specifically, two situations must be distin- 
guished. With probability the initial value is itself 
the maximum, yielding a zero maximum relative to the 
initial value. Thus the distribution is composed of a point 
mass of weight 1/n at z = 0. For z > 0, the probabil- 
ity distribution is continuous and the calculation of the 
probability density is as follows: without loss of gener- 
ality, label from among the remaining (n — 1) random 
variables the maximum as the final one. Then 

p(z) = (n-l) / dxi--- dXn~l / dxn 
Jo Jo Jo 

n 

X Y\ d{Xi)0{l ~ X^)5{z - {Xn - Xi)) 
1=1 

= l-z"-i, (6) 

where (n — 1) is a combinatorial factor taking into ac- 
count all possible locations of the maximum among the 
remaining [n — 1) random variables. 

Substituting the PDF in Eq. (O into Eqs. ©,(111), the 
location and scale parameters that maintain zero mean 
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and unit standard deviation are 
1 



n 



2 (n + 1) 



5 + 



An 



(7) 



(8) 



Thus, to leading order, the location and scale parameters 
are simply the mean and standard deviation of the uni- 
form distribution. As n — > oo the rescaled PDF of the 
maximum relative to the initial value converges to 



-3/\/3 <y< 3/\/3 
otherwise. 



(9) 



For more general cases, see [li 



C. Maximum relative to minimum 

Without loss of generality, labelling the minimum and 
the maximum as the first and last random variables, re- 
spectively, the distribution of the maximum relative to 
the minimum, z := Xn — xi is given by 

fi 

p{z) = n{n — 1) / dxi / dx2 • ■ ■ / dxn^i I dx„ 



Y[e{x,)e{i-x,)d{z~{xr,-xi)) 

i 

--n{n-l)z^"-^'>{l- z), 



(10) 

where n{n — 1) is a combinatorial factor taking into ac- 
count all possible locations of the maximum and the min- 
imum among the n variables. 

Substituting the PDF in Eq. ^ into Eqs. ©,© the 
location and scale parameters that maintain zero mean 
and unit standard deviation are 
n - 1 2 

+ (11) 



n + 1 
V2 



1 



+ Oin-'). 



(12) 



To leading order, the location parameter reflects the fact 
that the distribution centers at z = 1 from below as n — > 
oo. Meanwhile, the scale parameter reflects the fact that 
the width of the distribution shrinks as 0{n~^) to leading 
order. As n — > oo the rescaled PDF of the maximum 
relative to the initial value converges to 

piy) = V2{2 - V2y)e'^y-'^, -oo < y < V2, (13) 
and zero otherwise. 



D. 



Other distributions belonging to the Weibull 
class 



While the results given in Sections III AHII Cl are for the 
uniform distribution, the arguments used in their deriva- 
tion can be easily generalized to any other distribution. 



Namely, the widths under rescaling of the maximum and 
the reference value should be compared in order to de- 
termine the final limiting distribution for the extremal 
statistic. Generally, either one or the other will domi- 
nate, apart from the special case when both widths are 
rescaled in the same way, in which case there will be a 
convolution of the two distributions (assuming that the 
n-independent prefactors are comparable in magnitude). 
This is in particular true for another prominent example 
in the Weibull class, the beta distribution, with PDF 



p{z) = 



(1-z) 



5-1 



< z < 1, 



(14) 



and zero otherwise, where 7, 5 > and B('j,6) is the 
beta function. The maxima of the beta distribution are 
WeibuU-distributed, provided the width is rescaled with 
a„ ~ n~^/*. Thus, for example, the maximum measured 
with respect to the average yields a Gaussian, a convo- 
lution, and a Weibull distribution for (5 < 2, (5 = 2 and 
(5 > 2, respectively. 



III. EXTREMAL STATISTICS FOR 
STATIONARY LONG-RANGE CORRELATED 
SIGNALS 

The stationary long-range correlated series Xi of 
length n that we study have a uniform underlying PDF 
on [0, 1] and two-time correlation function that decays as 



C\,^{Xaj)-{X,){X,)^\i-j\ 



(15) 



where the average corresponds to an ensemble average. 
Stationarity requires that a < 1 and long-range correla- 
tions are those for which < a < 1. By the Wiener- 
Khinchin theorem, the Fourier transform of the two-time 
correlation function is simply the power spectrum and 
both are equivalent descriptions. For stationary long- 
range correlated series, it follows that the power spec- 
trum decays as 1//". 

In order to generate series of long-range correlated uni- 
form deviates, we employ the algorithm of Schreiber and 
Schmitz [3^, [3^. The method works iteratively to en- 
force a desired power spectrum by permuting iid random 
variables drawn from a desired distribution. Each iter- 
ation consists of two steps: adjusting the power spec- 
trum of the random variables in Fourier space with the 
appropriate filter, and then rank-order exchanging the 
reversed-transformed variables with the original iid ran- 
dom variables. Thus, the algorithm does not change the 
values of the variables drawn initially, but does change 
their order. For the uniform distribution, only a small 
number of iterations is required for the power spectrum 
to converge to the desired form - in our case, a power-law 
with slope —a. More details of the method are outlined 
in the Appendix. 

The statistics for the extremal quantities are collected 
in segments. Typically, we generated series of length 2^^ 
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to 2^^, cut up 2^ to 2^° times to produce blocks of length 
2^^ to 2^^. The process is repeated for a number of gen- 
erated series to produce histograms with 0(10^ — 10^) 
points. The histograms are normalized and rescaled to 
give p{y), with zero mean and unit standard deviation. 

As mentioned in the introduction, the three classes of 
extreme distributions for the maxima of iid random vari- 
ables remain robust for a wide class of dependent se- 
ries [13], including, for example, long-range correlated 
Gaussian series [38|. Numerically, we find that this is 
also true for the long-range correlated uniform processes 
defined above. This suggests that the relevant mixing 
conditions are satisfied for the case of long-range cor- 
related uniform deviates, although we have not checked 
this analytically [43|. Moreover, while there are strong 
finite size corrections to the variation of the scale param- 
eter a„ with n, even the asymptotic behavior or leading 
order scaling is identical to the iid = 1/n. Our 

numerics indicate that for small system sizes the higher 
order corrections are positive. Such a behavior is not 
generally expected since, owing to correlations, the lead- 
ing order scaling of a„ may be different than those in 
the iid case [l0|. In the following, we show that analo- 
gous results hold for the three relative maxima, i.e., the 
distributions derived in Eqs. . (P) . (fT^ for iid random 
variables also apply to the dependent case. 



A. Maximum relative to average 

Fig. [T] plots the maximum relative to the average 
for blocks of length 2-'^^, 2^'^, 2^^ (triangles, crosses, di- 
amonds) and a — 0.5. The curves are barely distinguish- 
able from the iid limiting Gaussian distribution, and the 
convergence is rapid for the system sizes examined. Nu- 
merically, we find that the same holds for < a < 1 (not 
shown). These results are expected since the width of 
the limiting Gaussian distribution of the average decays 
as n("-i)/2 [S, 40] while the width of the distribution of 
the maximum decays asymptotically as as mentioned 
above. Thus, the average dominates for any a implying 
that a Gaussian distribution is observed asymptotically. 




FIG. 1: (Color online) Distribution of maximum relative to 
average for sequences of length 2^^^, 2^'', 2^^ (triangles, crosses, 
diamonds) and a — 0.5, with zero mean and unit standard 
deviation. A Gaussian is plotted alongside as a solid line. 
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FIG. 2: (Color online) Distribution of maximum relative to 
initial value for sequences of length 2", 2", 2^^ (triangles, 
crosses, diamonds) and a = 0.5, with zero mean and unit 
standard deviation. A uniform distribution is plotted along- 
side as a solid line. 



B. Maximum relative to initial value 

Fig. [2] plots the maximum relative to the initial value 
for blocks of length 2-'^^, 2^"^, 2^^ (triangles, crosses, dia- 
monds) and a — 0.5. The curves closely follow a straight 
horizontal line of height •\/3/6. We find the same be- 
havior for other a with < a < 1 (not shown). If we 
assume that the distribution approaches the same limit 
as in the iid case, then the exact form for finite n is given 
by Eq. ([6]). The function approaches zero at its upper 
endpoint very rapidly, and this feature is also observed 
in Fig. [5] for the dependent case. Since the distribution 
consists of a point mass at zero maximum relative to the 



initial value, it is particularly convenient to analyze this 
behavior with increasing a. Fig. [3] demonstrates that 
the fraction p of times when the initial value is also the 
maximum decreases as 1/n. However, the amplitude of 
the decay increases with a. An intuitive explanation for 
this behavior is as follows: persistence along the series 
increases with a, i.e. the series has memory and is more 
likely to continue in the same direction as previously. A 
series that starts with a downward trend is more likely 
to produce a zero maximum relative to the initial value. 
Such persistent trajectories are more prevalent with in- 
creasing a, and therefore the amplitude of the point mass 
is enhanced relative to the iid case. 

In the presence of correlations, convergence to limit- 
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FIG. 3: (Color online) Decay in the fraction of zero maxima 
relative to initial value with increasing system size, for a — 
0.95,0.8,0.6,0.4,0.2 (top to bottom). The decay 1/n for iid 
series is shown by the solid black line. 



ing distributions is typically slower than in the iid case 
[4ll |. The effect of increasing a (i.e. correlations) can 
be taken into account by replacing the block size n by 
a smaller size rics — C{a)n, where C{a) < 1 is an a- 
dependent constant. If we assume a similar behavior for 
the maximum measured relative to the minimum, then 
the simplest modification to Eq. ^ for correlated series 
would consist of a point mass with weight l/(C(a)n) at 
zero, together with a density 



p{z) = 1 - 



(16) 



for z > 0. Fig. m plots estimates of C{a) for various a, 
suggesting that a decreased effective degrees of freedom 
description is valid. 



C. Maximum relative to minimum 

Fig. [5] plots the maximum relative to the minimum 
for blocks of length 2^^, 2^^, 2^^ (red, green, blue) and 
a = 0.5. Convergence is evidently slower for this maxi- 
mal quantity and worsens with increasing a (not shown) . 
We speculate that this is because the maximum and min- 
imum approach their respective limiting distributions at 
the same rate, giving rise to the convoluted distribution 
derived in Eq. (fTO]) in the iid case. 



FIG. 4: (Color online) Measurements of C(a) based on 
Eq. HSI for a = 0.2, 0.4, 0.6, 0.8, 0.95 (top to bottom) and 
n = 64, 128, 256, 512, 1024, 2048. For each a, the lines are av- 
erages over the six system sizes. Error bars are estimates of 
statistical error (via standard error propagation) and repre- 
sent one standard deviation: over A'^ measurements the num- 
ber of instances of the maximum also being the initial value 
is binomially distributed with expectation Np and variance 
A''p(l — p), where p = 1/n is the probability that the maxi- 
mum lies at the initial value. 
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FIG. 5: (Color online) Distribution of maximum relative 
to minimum for sequences of length 2^^, 2^^, 2^^ (triangles, 
crosses, diamonds) and a = 0.5, with zero mean and unit 
standard deviation. The distribution in Eq. (|13|) is plotted 
alongside as a solid line. 



D. Otiier distributions belonging to the Weibull 
class 

For long-range correlated processes obeying the beta 
distribution defined in Eq. (jl4[) , the same picture emerges 
as above once the relative rescalings of the maximum 
and the respective reference value are taken into account. 
Our numerics ^425 indicate that the distribution of the 



maxima and the minima both approach a Weibull dis- 
tribution and that a„ scales asymptotically as n^^/* for 
the maximum and n^^/'^ for the minimum — as in the 
iid case. As in the case of the uniform distribution, the 
width of the limiting Gaussian distribution of the aver- 
age scales as n^"~^^/^. Thus, depending on the particu- 
lar choice of a, 7 and S, different limiting distributions 
are obtained. For example, for j — 5 — 2 the extremal 
statistics obey respectively a Gaussian, a beta distribu- 
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tion with 7 = 5 = 2, and a convoluted distribution for the 
maximum measured with respect to the average, initial 
value, and minimum. 

In general, if a given WeibuU process satisfies the rele- 
vant mixing conditions [10] and a„ scales asymptotically 
like its uncorrelated counterpart, then, barring slow con- 
vergence, one of three scenarios will occur: (a) the width 
of the maximum distribution shrinks slower than that 
of the reference, in which case the maximum distribu- 
tion is observed, (b) the width of the reference distribu- 
tion shrinks slower than that of the maximum, in which 
case the reference distribution is observed, (c) the widths 
of the maximum and reference distributions scale at the 
same rate and are of comparable amplitude, in which case 
a non-trivial convolution is observed. 



IV. RETURN TIMES 

Return times are useful indicators for analyzing time 
series and are particularly relevant when forecasting ex- 
treme events, e.g. floods or large earthquakes. For ex- 
ample, a flood levee may be constructed so as to permit 
flooding only once every 10, 000 years, on average. Sup- 
posing that at time ti an event of magnitude cc, exceeds 
some threshold q, and that the threshold is subsequently 
exceeded for the first time by an event Xj at tj , then the 
return time is defined as t :— tj — ti. Since in our context 
time marches in unit steps, the smallest return time is 1, 
i.e. as a result of two consecutive events that exceed the 
threshold. 

In our simulations we generate series of length 
2^^, . . . , 2^^ a repeated number of times (~ 10^ to 10^). 
The return times derived from these series are then com- 
bined into a histogram, from which we construct the nor- 
malized return time distribution p{t). 

The iid scenario is described by a Poisson point pro- 
cess, giving exponentially distributed return times. For 
stationary long-range correlated series, meanwhile, the 
authors of [26l W\ proposed the following fit-free (but 
nevertheless a-dependent) ansatz for the return time dis- 
tribution: 

Tp(i) =a„exp[-5„(t/T)i-"], (17) 

where r is the mean return time, and and ba are a- 
dependent constants of normalization fixed by 

oo 

CO 

J2tp{t)=T. (19) 

t=l 

We test this stretched exponential ansatz for long-range 
correlated uniform deviates. Fig. [6Ka),(b) plots numer- 
ical results for a = 0.2 and two different quantiles. 
Stretched exponential curves are drawn alongside for 
a = 0.25,0.2,0.15 from top to bottom. In Fig. [HKa) the 



errors are within the symbol size up to t/r « 16, with 
larger statistical fluctuations thereafter. For Fig. HKb), 
errors remain within symbol size up to t/r « 13. The 
agreement between the numerical results and the ansatz 
is poor. Increasing the quantile worsens the agreement. 
For the results shown, there appears to be no system- 
atic trend towards the ansatz with increasing system 
size. Figs. [nic),(d) plot numerical results for a = 0.7 
and two different quantiles. Stretched exponential curves 
are drawn alongside for a = 0.75,0.7,0.65 from top to 
bottom. In Fig. [HKc) the errors are within the symbol 
size up to t/r ~ 140, with larger statistical fluctuations 
thereafter. For Fig. [IKd), errors remain within symbol 
size up to t/r « 50. The agreement between the numer- 
ical results and the ansatz is better than at a = 0.2, but 
still unsatisfactory. Increasing the quantile pushes the 
numerical results further away from the a — 0.7 ansatz, 
although there does seem to be an improvement in the 
overall shape. Increasing the system size does reveal a 
trend towards the ansatz curve for a = 0.7, although 
much larger system sizes would be required to examine 
whether this trend converges. A similar picture is ob- 
tained for the beta distribution (4^ . i.e. poor agreement 
between numerics and ansatz that worsens with increas- 
ing quantile. 

These observations are confirmed by plotting 

— ^ \og{Tp{t) / Ga) against t/r. If the proposed 

form of a stretched exponential is correct, then the 
numerical curves should approach a constant asymptoti- 
cally [26]. The main plots in Figs. 17181 show the results 
for a — 0.2, 0.7. The numerical curves bend around the 
horizontal line, suggesting that a stretched exponential 
is too simplistic to describe the functional form of the 
asymptote. We find that the agreement with the ansatz 
improves with increasing a. For the most part, the 
dependence on series length is weak for the series we 
considered. 

The particularly strong deviations from the proposed 
stretched exponential given in Eq. p?)) for t < t have also 
been observed for other stationary long-range correlated 
series [l^j. The authors conjecture that instead the initial 
asymptote rather follows a power law with slope —a. We 
test this hypothesis in the insets of Figs. I7I8I for a = 
0.2,0.7 respectively. The agreement with the power-law 
ansatz is reasonable and improves with decreasing a and 
increasing quantile. 

To summarize, while it is still possible that the ansatz 
for the return time distribution of stationary long-range 
correlated series given in Eq. (|17p generally holds for 
t > T in the limit of infinitely long time series, there 
are significant deviations from it for the long but finite 
time series considered here. These deviations also depend 
on the threshold in a counter-intuitive way such that the 
deviations are stronger for higher thresholds. 





FIG. 6: (Color online) Top panels: distribution of return times above the (a) 97.9%, (b) 99.7% quantile for sequences of length 
2^®, 2^^ (triangles, crosses) and a = 0.2, with mean return times (a) r ~ 48, (b) r ~ 287. Stretched exponential ansatzes (see 
Eq. (|17[) ) are plotted alongside for a — 0.25,0.2,0.15 from top to bottom. Bottom panels: distribution of return times above 
the (c) 97.9%, (d) 99.7% quantile for sequences of length 2^^, 2^^ (triangles, crosses) and a = 0.7, with mean return times (c) 
T « 48, (d) T « 287. Stretched exponential ansatzes are plotted alongside for a = 0.75, 0.7, 0.65 from top to bottom. 



V. CONCLUSION 

Long-range correlated series with an underlying distri- 
bution belonging to the WeibuU class of extreme values 
have received little attention in the literature. We be- 
lieve that our study will be of relevance to certain quan- 
tities that approach finite bounds sufficiently slowly (for 
a precise mathematical statement, see ^J). Possible ex- 
amples include humidity G3| , strength of materials [i^l , 
and critical path analysis [4^. In the latter case, the beta 
distribution is often used for modelling completion times 
of activities. 

While in the mathematical literature one can find im- 
portant extensions of iid extreme value statistics to sta- 
tionary dependent series [13, [HI , we have focussed on ex- 
tremal quantities that are measured with respect to a ref- 
erence point that is itself a random variable. Apart from 
the mathematical interest, this is motivated by practical 



applications. To pick just one of the examples, the maxi- 
mum relative to the minimum gives a measure of the full 
range traversed by a trajectory in a stochastic process. In 
many cases, the distribution of the reference point dom- 
inates or convolves with the distribution of the extremes 
giving rise to extremal distributions very different from 
the WeibuU distribution. Moreover, we found that the 
extremal distributions in the correlated stationary series 
converge to their iid counterparts for the specific pro- 
cesses considered. 

While it was proposed that the form of return interval 
distributions in correlated stationary series asymptoti- 
cally approaches a stretched exponential for large return 
intervals independent of the underlying distribution [27| , 
our results for the uniform and beta distribution do not 
support this. Although a stretched exponential is appeal- 
ing and fully-determined once a has been estimated, we 
believe that this can only be a first approximation. How- 



8 




FIG. 7: (Color online) Main plot: distribution of return times 
above the 99.7% quantile for sequences of lengths 2^^ (tri- 
angles, crosses) for a = 0.2. Inset: distribution of return 
times above the 99.1%, 99.7% quantiles (triangles, crosses) for 
sequences of length 2^^ and a = 0.2. A power law with slope 
—a is plotted alongside. 



FIG. 9: (Color online) Distribution of step sizes for series 
of length 32768 and a — 2. A Gaussian and a symmetric 
exponential, both with the same standard deviation as the 
data, have been plotted alongside as solid lines. 




FIG. 8: (Color online) Main plot: distribution of return times 
above the 99.7% quantile for sequences of lengths 2^^, 2'^^ (tri- 
angles, crosses) for a = 0.7. Inset: distribution of return 
times above the 99.1%, 99.7% quantiles (triangles, crosses) for 
sequences of length 2^^ and a = 0.7. A power law with slope 
—a is plotted alongside. 



ever, a power-law decay with slope —a for short return 
intervals is more convincing. Together with the results 
presented in Ref. [13], this suggests that this behavior 
might be universal and independent of the underlying 
distribution. 
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VI. APPENDIX 

The numerical results presented in this article are 
based on an algorithm proposed in Refs. [s^ [s^ to gen- 
erate realizations of stochastic processes with a desired 
PDF and desired correlations. While the algorithm has 
been extensively tested and is well-established, questions 
remain as to the nature of the sample paths, the step size 
distribution, etc. To investigate these points, we apply 
the algorithm to the case of a uniform PDF with a — 2, 
which should correspond to a diffusion-like process. This 
is indeed what we find. 

In Fig. [9] we plot the distribution of the step sizes 
between successive points in series of length 32768 and 
100 = 2. The distribution lies somewhere between a Gaus- 
sian and a symmetric exponential, i.e. distributions with 
suppressed tails (as opposed to, say, power-law tails aris- 
ing from an underlying Levy motion). 

To minimize the effect of the finite support of the uni- 
form distribution, we also studied the time evolution of 
those realizations or series beginning in the middle of the 
interval. The inset of Fig. [10] plots the standard devi- 
ation of the distribution of these series as a function of 
time. Numerically, among our ensemble of series we in- 
cluded all those that began with values in the narrow 
interval 0.475 < a; < 0.525. Since the random variables 
are drawn from a uniform distribution, at time t = their 
distribution has standard deviation a{t = 0) = \/3/120, 
which is indicated by the lower horizontal line. For in- 
termediate times the distribution spreads with a stan- 
dard deviation a{t) oc y/t. For long times the distribu- 
tion crosses over to a uniform spread across the interval, 
as indicated by the upper horizontal line with standard 
deviation a{t — > oo) — \/3/6. This is consistent with 
diffusion. However, an idiosynchracy of the Schreiber- 
Schmitz method is that the possible values encountered 
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FIG. 10; (Color online) Time evolution ol the PDF for se- 
quences initially starting within 0.475 < x < 0.525 after 
25, 100, 500 steps (top to bottom), for L = 32768 and a = 2. 
Gaussians with the same standard deviation as the data are 
plotted alongside. Inset: Time evolution of the standard de- 
viation of the PDF (crosses) for sequences initially starting 
within 0.475 < a; < 0.525. The standard deviations of the 
initial and final uniform distributions are ^3/120 and ^3/6 
(horizontal lines), respectively. For a range of intermediate 
times the standard deviation grows with the square root of 
the number of steps (diagonal line). 



in the series are quenched at time t = 0. Therefore, pre- 
selected realizations starting in the middle narrow inter- 
val exhaust a fraction of the uniform deviates from this 
region of the [0, 1] interval and, correspondingly, a rel- 
atively larger amount of values is subsequently encoun- 
tered around the endpoints. This is the reason for the 
overshoot of the standard deviation past the upper hori- 
zontal line, since the distribution is slightly enhanced at 
its endpoints (and shghtly diminished around 0.5) with 
respect to a uniform distribution. But after all the values 
in the series have been accounted for, the distribution of 
the original uniform draw is recovered. Fig. 1 101 illustrates 
that, for intermediate times, the evolving distributions 
are well approximated by Gaussians, as expected for dif- 
fusion. 
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